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© Method and apparatus for identifying words entered on DTMF pushbuttons. 



© The identity of a word entered on DTMF (Dual Tone Multi-Frequency) pushbuttons is determined by 
identifying letters according to their frequency in groups of 2s and 3s and up to n-letter groups. Initially, a sender 
will type a word on a DTMF pushbutton pad. Since each button represents three (3) possible letters, or four in 
the case of 7 (PQRS) and 9 (WXYZ), the system will look up the possible meanings for the word from an internal 
memory which includes a dictionary. If the word does not exist in the dictionary, then the system will search 
against a Forward and a Reverse Dictionary to guess at the identity of fragments from the beginning and ending 
of the word. Then it will guess the identity of each letter according to its relative frequency in groups of 3s 
known as Trigrams, or in groups of 2s known as Digrams. The Trigrams are preferably arranged in five (5) files 
to identify the first, second, third and final letters of a word as well as any non-specified middle location. The 
system can be extended to any letter grouping of size n where n is two or more. Once identified, the word is 
stored as part of a message and then transmitted as ASCII digital information over a packet data network to a 
remote receiver where digital to voice synthesis converts the message into audio. According to a simplified 
alternative embodiment, the words are guessed not letter-by-letter but word-by-word from a data bank in which 
'words are stored according to the length, numeric equivalent and relative frequency. Other delivery methods 
include alphanumeric pagers, FAX, E-Mail, telex, computer printed output, telegrams and cables. 
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METHOD AND APPARATUS FOR IDENTIFYINGWORDS ENTERED ON DTMF PUSHBUTTONS 



1 . Field of the Invention 

The invention relates to a method and apparatus for identifying words from letters entered on a DTMF 
5 pushbutton pad, and, once the word is identified, transmits the information in ASCII digital form on a packet 
transport network to a location near the receiver where it is converted into speech. 



2. Description of Related Art 

70 

The concept of trying to determine the identity of a word based upon the frequency of the letters in the 
word is generally known in the cryptanalytical art. An interesting discussion concerning the frequency of 
words in the English language is found on pages 18 and 20 of the March 1988 edition of the Atlantic. For 
example, it is noted that nine (9) English words comprise fully a quarter of all spoken words and that forty- 

15 three (43) words account for half of the words that we use in every day speech. However, insofar as can be 
determined, none of the prior art appears to teach or suggest a sophisticated word guessing method and 
apparatus that can be employed In the context of a communications system. 

The prior art also discloses some systems for using a DTMF keypad to transmit messages to a remote 
location where a speech synthesizer reproduces a word. Typical of such systems is the invention described 

20 in U.S. Patent No. 4,307,266 entitled COMMUNICATION APPARATUS FOR THE HANDICAPPED. Accord- 
ing to that invention, a sender enters the appropriate position for the letter of the alphabet to be 
communicated. A second entry identifies which one of the plurality of letters attached to the telephone set 
which decodes the two digit code entry and converts the two-digit code to a standard machine readable 
format which can be displayed on a standard output device such as an alphanumeric display or, for blind 

25 persons, a braille or similar output message. The sending party can transmit the message to the receiving 
party by keying characters into a keyboard. Keyed-in characters are converted to speech by a speech 
synthesizer for transmission to the receiving party. In other words, the sender types the word in DTMF code 
and the resultant word is reproduced at the receiving station by synthesized voice. While the foregoing 
system may be useful for certain applications, it seems to be rather cumbersome, limited in its vocabulary, 

30 and relatively inefficient in its transmission mode. 

Other patents which discuss use of DTMF codes for transmission purposes include 4,087,638; 
4,427,848 and 4,608,460. 

Some attempts have been made in the prior art to identify a word from a DTMF dial tone clue. A useful 
description of such techniques is found in an article entitled "Digital Techniques for Computer Voice 
35 Response: Implementations and Applications", Lawrence R. Rabiner, Fellow IEEE and Ronald W. Schafer, 
Senior Member, IEEE PROCEEDINGS OF THE IEEE, Volume 64, No. 4, April 1976. 

Of more general interest is an article entitled "Alphabetic Data Entry Via the Touch-Tone Pad: A 
Comment", Sidney L. Smith and Nancy C. Goodwin, The Mitre Corporation, HUMAN FACTORS, 1971, 13- 
(2) Page 189-190. 

40 U.S. application 4,737,980 entitled COMPUTER DATA ENTRY METHOD AND APPARATUS describes a 
device mounted near a telephone for identifying numbers and letters entered on telephone DTMF tone 
buttons. A very simple algorithm is employed to identify letters and numbers one at a time. The device, has 
the drawback that it must be mounted near the telephone, requires more keystrokes to correctly identify a 
letter and is prone to making more incorrect guesses than the present invention. 

45 U.S. patent 3,677,659 entitled TELEPHONIC DATA ACCESS AND TRANSMISSION SYSTEM describes 
a system for transmitting and accessing data through a standard telephone network. The system includes 
the activating of a key or sequence of keys wherein each key represents a plurality of letters on a 
telephone. The signals produced by activating the keys are compared to a data base having indexed entries 
of words or names. The data base preferably comprises a telephone directory so that an individual can look 

so up a name and get a telephone number. The invention places emphasis on Boolean exclusion which 
requires a significant amount of interaction with the caller to eliminate collisions. Accordingly, the system is 
much more difficult and inefficient than the present invention described in this disclosure. 

U.S. patent 4,614,841 entitled GEOGRAPHICALLY DISTRIBUTED MULTIPROCESSOR TIME-SHARED 
COMMUNICATION PROCESSING SYSTEM describes a packet transport-like system including intelligent 
terminals at both ends connected to independent data bases. The system otherwise does not appear to 
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teach or suggest a mechanism for efficiently guessing the meaning of words entered letter-by-letter on 
DTMF pushbuttons, converting the information to ASCII code, transporting the code over a packet transport 
network and converting the code into synthesized speech at the remote end. Similarly, U.S. patent 
4,685,125 entitled COMPUTER SYSTEM WITH TASKING describes a packet transport network that 
5 includes a plurality of nodes. 

U.S. patents 3,533,069 and 4,164,025 are of possible interest in that they disclose related directory 
assisted retrieval systems. 

U.S. patents 4,649,563 and 4,757,266 entitled respectively METHOD AND MEANS FOR ACCESSING 
COMPUTERIZED DATA BASES UTILIZING A TOUCH-TONE TELEPHONE INSTRUMENT and TELE- 

w PHONE SYSTEM FOR CONNECTING A CUSTOMER TO A SUPPLY OF GOODS describe related systems 
of possible interest invented by Bernard N. Riskin, the inventor of the present system. 

Of general interest also are the following items from the patent literature: U.S. Patents 3,778,553, 
3,967,273; 4,012,599; 4,191,854, 4,426,555; 4,532,378; 4,557,062; 4,585,908; 4,608,457 and 4,633,041 as 
well as the following British Patents: 1,320,895 and 1,320,896. 

75 Whether taken individually or as a whole, none of the prior art appears to suggest the novel apparatus 
and method set forth in this disclosure for efficiently identifying a word typed on a DTMF keyboard and 
converting that word into ASCII digital format so that it can be efficiently transported via a packet transport 
network to a remote location where it is converted from data to speech or otherwise delivered such as to a 
telex. 

20 

SUMMARY OF THE INVENTION 



25 Briefly described, the invention comprises a method and apparatus for determining the identity of a 
word entered on a DTMF pushbutton pad by identifying letters according to their frequency in groups of n 
letters where n preferably equals two or three but can be much larger. A word is typically entered on a 
DTMF pushbutton pad one letter at a time. Six (6) of the numbered buttons on the DTMF pad correspond to 
three (3) letters of the alphabet and button #7 is PQRS and button #9 is WXYZ. The number string is 

30 compared against a dictionary of words in internal memory. The system feeds back the guessed word is 
rejected by the sender or from the dictionary to the sender to determine if the guessed word is correct. If 
the guessed word is rejected by the sender or is not found in the dictionary, the system next looks in a 
forward dictionary file and a reverse dictionary file for matches and if no satisfactory match is made, then it 
searches for groups of 3 letters referred to as Trigrams. The trigrams are preferably broken down into five 

35 (5) files, namely: Trigram I, which is a file for guessing the initial letter of a word; Trigram 2, which is a file 
for guessing the second letter of a word based upon the first letter of a previously guessed and confirmed 
alphabetic output string; Trigram 3, which is a file for guessing the third letter of a string based upon the 
first two letters of a previously guessed and confirmed alphabetic output string; Trigram Middle, which is a 
file employed to guess at the identity of any middle letter based upon the known identity of the previous 

40 two letters of a three letter string; and, finally, Trigram Final, which is employed to guess at the final letter of 
a word based upon the known identity of the previous two letters. The reason for employing five (5) Trigram 
files is because the initial, second, third and final letters of a word have different frequencies of individual 
occurrences and different frequencies of three letter combinations than letters in other positions in words. 
Using five (5) different files takes maximum advantage of the knowledge of the subject string in terms of 

45 each letter in its position within the word and within the context of its surrounding letters and/or digits. 

If a Trigram record does not exist for a particular three letter string, then the system will default to 
analyzing groups of two letters referred to as Digrams. 

The system can also be expanded to letter groups of size n where n is greater than two or three. The 
larger the group size n, the greater the word guessing accuracy, but such a system also requires 

so substantial internal memory and substantial processing. 

After all of the words of the message have been identified and converted into ASCII code, the system 
transmits the information via a packet transport network to a remote location where the ASCII code is 
converted by a digital-to-speech synthesizer to an audio output understandable to a receiver at the remote 
location or to one of several other delivery modes. 

55 According to a simplified alternative embodiment of the invention, it is possible to identify a word at a 
time rather than a letter at a time. In this system a data base is generated in which the words are stored 
according to the length, numeric equivalent and relative frequency. An individual using the alternative 
embodiment, types in the word on a DTMF pushbutton pad and the system guesses at the word based 
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upon the information stored in the word guessing dictionary. The alternative system appears to work better 
for unique words, however there are a lot of short words that have the same numeric string. In contrast, the 
preferred embodiment of the invention combines both full word guessing with sophisticated letter-by-letter 
guessing and as a consequence operates faster. 
5 There are a variety of other applications for the basic word guessing technology that are discussed in 
detail in the Detailed Description of the Invention. 

The invention may be further understood by reference to the following drawings. 



io BRIEF DESCRIPTION OF THE DRAWINGS 



FIG. 1 illustrates an application of the telecommunication system according to the present invention 
in which two telephone access nodes are used to route a message over a packet transport network from a 
15 sender to a receiver. 

FIG. 2A is a general block diagram of a typical telephone access node. 

FIG. 2B is a detailed schematic illustration of the telephone access node illustrated in FIG. 2A as 
shown in the context of its SEND application . 

FIG. 2C is a detailed schematic illustration of the telephone access node illustrated in FIG. 2A as 
20 shown in the context of its RECEIVE application. 

FIG. 2D is a process chart illustrating the interactions of a sender and the prompter function while 
assembling a message on a telephone access node such as shown in FIG. 2A. 

FIGS. 3A-3F illustrate the steps of the letter-at-a- time word guessing process. 

FIG. 4A illustrates the method of generating Trigram files from dictionary files according to the 
3 preferred embodiment of the invention. 

Fig. 4B and 4C are tables of the format and sequence of records in the Trigram files. 

Figure 5A illustrates a simplified alternative embodiment of the invention and describes a process 
chart showing the interactions between a sender and the prompter function while assembling a message on 
a telephone access node. 

30 Figure 5B also describes the simplified alternative embodiment and shows a schematic diagram 

illustrating the memory organization of the word guessing dictionary of the access node. 

Figure 5C also describes the alternative embodiment and illustrates a table showing the four different 
modes that can be invoked using a standard DTMF multi-button pad. 

35 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 



During the course of this description like numbers will be used to identify like elements according to the 

40 different figures illustrating the invention. 

FIG. 1 is a general schematic diagram of the preferred embodiment of the invention 100 showing how a 
message can be sent from a sender in the northeastern United States to a receiver 22 in the southwestern 
United States. The message will be delivered by synthetic voice via an ordinary telephone. FIG. 1 also 
illustrates potential alternative routine via gateways to destinations anywhere in the world and via a variety 

45 of different delivery modes including Telex, cable, alphanumeric pager and electronic mail. 

Initially it is assumed that the sender 10 knows the telephone number of the nearest telephone access 
node 12 to him or her referred to herein as an entrance telephone access node. Entrance telephone access 
node 12 is connected by one or more telephone lines to a nearby packet transport network access node 
(PTN access node) which in turn is connected to a PTN (Packet transport network) 16. The telephone 

so access nodes 12 and 20, in addition to their PTN connections are also connected to telephone networks 
associated with the sender 10 or receiver 22 or both. Each telephone access node 12 or 20 preferably 
includes multiple connections to the telephone networks so as to be capable of conducting many dialogues 
simultaneously. The entrance telephone access node 12 functions to provide both entrance and egress to 
and from PTN 16 via PTN access node 14. Node 12 is labeled "entrance" in FIG. 1 and node 20 is labeled 

55 "egress" because it is the function that they perform for the purpose of illustrating the invention 100. 
However, it will be understood that both nodes 12 and 20 are identical to each other and, their roles could 
be easily reversed if it was desired to send a message from receiver 22 to sender 10. 

The caller or sender 10 initially dials up the closest telephone access node to him using the DTMF 
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, buttons 60 on his or her telephone instrument and the entrance telephone access node 12 will respond to 
the call. Telephone access node 12 then conducts a dialogue with the sender 10 concerning the preferred 
routing of the call. In this manner the sender 10 interacting with the entrance telephone access node 12 
determines the preferred delivery mode of the message. The delivery may be by synthetic voice through a 

5 telephone, Telex I, Telex II, foreign or domestic cable, alphanumeric pager, FAX, computer printout or by 
electronic mail (E-Mail). This is possible because the entrance telephone access node 12 includes routing 
tables 46 shown in FIGURE 2B which can route via multiple gateways and via multiple delivery modes. 

If the receiver 22 has a Telex I or II, the access node 12 requests the sender 10 to enter the Telex 
number and optional answer back. Then, this part of the initial dialogue is completed. If the delivery is to be 

10 via voice over the telephone, the entrance telephone access node 12 asks the telephone number of the 
receiver 22. The telephone number may represent a voice telephone instrument or a computer equipped 
with a modem if the delivery mode is dial-up E-mail. The E-mail option can also be accomplished by 
soliciting the network membership and mail box identification number of the receiver 22. If the receiving 
device is an alphanumeric pager, the dialogue will request the pager number. 

75 The dialogue can be further continued to request from the sender, requiring only "Yes" or "No" 
answers, his or her preference for options such as proof of delivery or other additional parties that he or she 
wants to receive the message. If additional parties are to receive the message, then the entrance telephone 
access node 12 will request additional Telex, pager or telephone numbers. 

The foregoing discussion assumes Telex routing is made possible by gateways between the packet 

20 transport network and the Telex network. Therefore, this discussion concentrates on routing to the receiver 
22 with a telephone through the use of synthetic voice. This discussion also assumes that if the message is 
to an alphanumeric pager or to other delivery methods, that this fact will be made known to the computer 
by an access means such as a special telephone number or by the dialogue between the sender and the 
synthetic voice function of the sending telephone access node. 

25 If the sender 10 enters a choice for a voice message the receiver's 22 telephone number, the entrance 
access node 12 will examine a table to determine the PTN access node 18 nearest to the egress telephone 
access node 20 serving that telephone number. The table is composed of telephone number NPA-NNX's 
and their associated approximate latitudes and longitudes. This table is referred to by the publisher, Bell 
Communiations Research, as the "V-H" File (Vertical-Horizontal). Given this table and the listing of all of the 

30 access nodes in the system 100, the entrance telephone access node 12 will determine the PTN access 
node 18 which is closest to the receiver's telephone access node 20 which is in turn closest to the 
receiver's telephone number. According to FIG. 1, the selected PTN access node 18 is the closest to 
egress access node 20 which is closest to receiver 22. The word "closest" here is meant to refer to the 
lowest cost telephone call charge. 

35 The entrance access node 12, having selected PTN access mode 18 as the exit PTN access node, will 
address all of the packets it creates to that PTN access node for referral to egress telephone access node 
20. This completes the entrance telephone access node 12 routing procedure for transmission from a 
sender 10 to a receiver 22 in the United States in the case of message delivery by synthetic voice. 

The system 100 also has the capacity to deliver messages on alphanumeric pager 29, or on a voice 

40 mail box 25, or by printed mail from hard copy printers 19 or may access data bases 21. 

FIG. 2A is a block diagram of a typical telephone access mode 12 which can function as either an 
entrance telephone access mode 12 or an egress telephone access node 20. A typical telephone access 
node 12 or 20 has multiple connections 30 to a standard conventional telephone network so as to be able to 
conduct many conversations with senders 10 and receivers 22 simultaneously. The standard conventional 

45 telephone network is not illustrated or discussed in detail in this disclosure since it is well known to those of 
ordinary skill in the art. The telephone access nodes 12 (or access node 20) also include a data link 32 to 
PTN access node 14 which in turn connects with PTN 16. 

In addition to its function of communicating with the standard telephone network and the PTN 16, the 
telephone access modes 1 2 and 20 also discharge two general applications with respect to sending and 

so receiving messages. These applications are described in greater detail in FIGS. 2B-2C and 3. The major 
functions supporting the sending and receiving applications are the prompter, speaker and dictionary 
functions. The prompter function "listens" to the DTMF tones sent by the sender 10 and guesses the 
meaning of the word entered by the sender 10. The speaker function relays to the sender 10 the guesses 
selected by the prompter function. The speaker function is performed in part by a conventional text-to- 

55 speech circuit board which can read the alphabetic data in the dictionary and pronounce the words in 
audible speech by interpretation of the characters contained in memory. Devices of this description are 
manufactured by Digital Equipment Corp., Texas Instruments and Natural Micro Systems. Software for this 
function is provided by Berkeley Speech Technology and others. The dictionary contains several thousand 

5 
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English words from which the prompter function selects its guesses. 

After completing the routing procedure, the entrance telephone access node 12 will invite the sender 10 
to type his or her message on the DTMF telephone buttons. The basic procedure followed by the sender 10 
is to type on the DTMF keyboard burtons 60 as if the letters inscribed thereon were actually functional. The 

5 actual information transmitted by the sender's telephone to the telephone access node 12 is, of course, only 
one of twelve distinct pairs of tones per each button depression. The letters employed by the system are 
those commonly inscribed on the DTMF telephone buttons 60 with the exception that the letter Q is 
assigned to the 7 button (PQRS) and the letter Z is assigned to the 9 button (WXYZ). The letters Q and Z 
are usually not found on conventional telephone dials or DTMF buttons 60. 

10 The pound sign (#) on the DTMF telephone buttons 60 represents, according to the preferred 
embodiment of the present invention 100, the space bar on a standard typewriter. Each time the sender's 
telephone transmits the tone combination for the # to the telephone access node 12, the telephone access 
node 12 will assume the sender 10 has finished typing a word. It will then guess the word and, using its 
speaker function, speak the word to the sender 10. 

J5 The prompter function has two clues which it uses to decipher a numeric string typed by the sender 10 
into an alphabetic word, namely the number of digits and the digit string. 

Initially, assume that the sender 10 types THE which the entrance telephone access node 12 receives 
as 843. The sender 10 then types the # after the 3, thus terminating the word and the prompter function will 
be in receipt of the string 843#. The prompter function contains a dictionary of a majority of the words in 

20 common usage in the English language in its random access memory. The sequence of the dictionary 
entries is by (1 ) number of digits in the word (word size) (2) string definition being the numeric equivalent of 
the alphabetic word emitted by typing on a DTMF pad, e.g. 343 for THE, and (3) possible meanings of the 
string sequenced by probability of use. According to the former example, the prompter function will contain 
an entry of word size 3 digits, the string 843, and, e.g. the words THE, TIE, and VIE which represent the 

25 three possible interpretations of the string 843 listed in the order of their frequency of use in the English 
language. The dictionary contains a list of words corresponding to each numeric combination (e.g., 843). 
The choice of words is based on their frequency and words will be carried in the dictionary until it is 
calculated that their frequency is below some predetermined threshold. This threshold is definable in 
expected occurrences per million words of text. 

30 An acceptable source of words and their frequencies is the AMERICAN HERITAGE WORD FRE- 
QUENCY BOOK by Carroll, Davies, and Richman published by Houghton Mifflin Company. The foregoing 
book deals with frequencies encountered in eight million words of text. 

According to the previous example, the most frequent interpretation expected of the string 843 is THE. 
The entrance telephone access node 12, using the speaker function, will speak the word THE to the sender 

35 10. If THE is the correct interpretation for 843, the sender 10 wil proceed to type the next word. If THE is 
not the correct interpretation for 843, the sender 10 will deny the guess by typing another #. The speaker 
48 will then speak the next most likely word, in this example, TIE, If the sender 10 denies TIE by typing 
another # sign, the speaker 48 will speak VIE. If the sender 10 denies VIE, then the access node will 
announce to the sender 10 that it is shifting modes and will decode the word by reference to the Forward 

40 Dictionary 43a, Reverse Dictionary 43b, Trigrams 43e-43i, and Digrams 43c and 43d. 

Given a particular numeric string such as 843, there are very few valid words that correspond to it. 
These valid words usually display widely divergent possibilities of use in English text. Therefore, guessing a 
word, given its size and numeric DTMF string, has been discovered to be an efficient and practical method 
for identifying words. 

45 The dictionary function can be easily contained in the conventional equipment manufactured for micro- 
computers. An uncompressed dictionary of some 300,000 words can be contained in a memory board of 3 
million bytes. In practice, some compression would be used, at a minimum to represent multiple forms of a 
verb in a single entry. For example, the word WALK, would represent walker, walking, walked, etc. 
Compression is useful because it minimizes computer memory. As the compression increases, the amount 

so of memory will decrease but the response time will increase. The cost of RAM memory has become 
sufficiently low so that it may be preferable to minimize or eliminate the compression so as to maximize 
response performance and minimize software effort. 

The foregoing has basically described the invention set forth in my co-pending application Serial No. 
07/126,630 entitled Packet Network Telecommunication System Having Access Nodes With Word Process- 

55 ing Capability" filed on November 30, 1987. The major difference between the invention of this disclosure 
and the disclosure of my co-pending application Serial No. 07,126,630 occurs where the word sought for 
does not appear in the internal dictionary. According to my co-pending patent application, the system would 
guess at each letter based upon that individual letter's individual frequency and feed that guess back to the 
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sender. The sender would either approve or disapprove of the letter guess until the correct letter was 
identified. According to the invention of this disclosure, the system includes additional information about the 
frequency of an individual letter within a given string of n letters where n is at least two (2) and is preferably 
three (3) or more so as to further enhance the capability of identifying the word. Moreover, once the word is 
5 correctly identified, it will be automatically added to the memory of the system so that a second occurrence 
of the word will be automatically guessed at the next time that it is entered by the caller. 
FIGS. 3A - 3E describe "Letter-at-a-Time" processing. 

The invention employs a dictionary 42 of the 18,000 most common words in the English language 
selected according to their expected occurrence per million words of text. The expected occurrence per 
10 million words of text is associated with each word in the dictionary. The number 18,000 is arbitrarily chosen 
as this is the point below which words have an expected occurrence below one per million words of text. 
The dictionary represents over 95% of the words commonly used in English text. 

If an input "DTMF numeric" string does nsot exactly match an entry in the 18,000 word dictionary, then 
the Letter-at-a-Time processing is performed. "DTMF numeric" refers to the numbers letters on the DTMF 
is buttons, for example the 2 button has the letters A, B and C; the 3 button has the letters D, E, and F and so 
forth. 

The primary goal of translating from an input DTMF numeric string to a letter string of an unknown (not 
in the original dictionary) word is to minimize the number of guesses to achieve the intent of the user. 
However, the penalty assessed against an incorrect guess must be optimized against the amount of 

20 dictionary data and their storage and processing requirements. 

The invention assumes that words which are not in the dictionary will have the same "structure" as 
words which are in the dictionary. This assumption applies to proper nouns as well as to common words but 
will not be as true for proper nouns as for common words. The reason for the diminished expectation of 
accuracy in processing proper nouns is that they are frequently of foreign origin whereas the dictionary is 

25 specifically based on English text. (Accuracy in processing proper nouns can be enhanced by including 
samples of proper nouns in the original dictionary taken from all possible ethnic derivations.) Structure is 
characteristic of a particular language and the structure involved in this discussion is peculiar to English in 
that the dictionaries are derived from English words. The general notions apply to any language but a 
different set of dictionaries is required for each language. 

so Given the expected occurrence per million words of text for each word in the dictionary, these words 
can provide the basis for the extraction of text "snippets" (letter sequences) which will have the same 
expected occurrences per million words as the words from which they are extracted. These snippets can be 
ordered and tabulated by their expected frequencies per million words of text to that for a given n-letter 
snippet, the expected frequency per million words of text for that snippet can be computed. 

35 Two or three letter snippets are referred to as "digrams" and "trigrams". However, it is not necessary 
to restrict the construction of snippets to digrams and trigrams. "N-grams" can be extracted from the 
original dictionary which may be only one letter less than the words from which they are extracted. For 
example, the word DICTIONARY may be the source for an n-gram of DICTIONAR in which n = 9. Also, 
taking the n-gram from the right side of the word will yield ICTIONARY which is also a 9-gram. 

40 The assumption is that letters occur in predictable frequencies, and that these frequencies are 
dependent in part on the position of a letter in the word. For example, the letter Q is more frequently found 
as the first letter of a word than as the last letter of a word. The particular letter positions addressed by this 
invention are the first, second, third, final, and "all-other" positions. Therefore, five sets of trigrams are used 
to guess the alphabetic interpretation of a DTMF numeric string. 

45 This process can be compared to cryptogram puzzles which appear in newspapers and to military 
cryptography. In a cryptogram puzzle, a given letter represents some other letter throughout the complete 
puzzle. In military cryptography, a given letter (or other symbol) substitutes for another letter but the 
substitution is always changing and changing randomly. In this invention, 2 always means A or B or C, 3 
always means D or E or F, etc. However a given digit, e.g., 2, can change its meaning from A or B or C 

so anywhere in any word - i.e., it does not have a constant meaning as is the case in a cryptogram puzzle. 

Summarizing the concept of "structure" as used in this context, a computerized dictionary of 18,000 
words each associated with an expected occurrence per million words of text is used as the primary word 
source and also as a source of two special dictionaries - a Forward Dictionary and a Reverse Dictionary, 
and of trigrams each also associated with expected occurrence per million words of text. The composition 

55 of the two special dictionaries and of the trigrams and the associated expected frequencies per million 
words of text are defined as the 'structure" of the language. Given knowledge of the structure of the subject 
language, the system can compute alphabetic guesses for DTMF numeric strings which are not part of the 
original 18,000 some words. The accuracy of the guess depends primarily on the length of the snippets 
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pieced together to construct the guess - the longer the snippet, the more accurate the guess. 

"Collisions" may occur. The word "collision" is defined to mean multiple alphabetic strings which 
translate to the same DTMF numeric string. A collision occurs when the system finds more than one 
alphabetic string corresponding to a given numeric string. The numeric string 66 is an example of a 

5 collision because it corresponds to ON or NO. Another example is 843 which corresponds to THE or TIE. 

We assume that the probability of a collision decreases exponentially as the number of digits in a 
DTMF numeric string matched to the numeric equivalent of an alphabetic string increases. Conversely, the 
probability that a match is the one and only one correct interpretation of a DTMF numeric string increases 
exponentially as the number of string digits increases. The probability that there is more than one 

to alphabetic string that is an acceptable (that such a word actually exists in some dictionary) interpretation of 
an input numeric string is approximately one in d (number of digits) raised to the third power of the number 
of digits. For a one digit match this is one chance in one, for a two digit match it is one chance in 2 to the 3 
power or eight, etc. The number 3 is selected as the exponent because there are three alphabetic 
characters on each numeric DTMF button except for the 7-PQRS and the 9-WXYZ. The letters Q and Z 

15 occur so infrequently in English text as to be negligible in estimating probabilities. This probability estimate 
ignores the redundancy in English language which tends to make the chance of success even more 
favorable). These probability estimates strongly suggest that the most successful system will be based on 
attempting to match the longest strings available from the Forward and Reverse Dictionaries derived from 
the original dictionary. 

20 The resolution of collisions between words found in the dictionary is achieved by guessing the 
interpretation of the numeric string in the sequence of the expected occurrence per million words of text. 
For example, for the string 843, THE is the first guess and TIE is the second guess because that is the 
sequence of their expected occurrence per million words. The same process is used when guessing letters 
based on trigrams. If there are several trigrams which might translate to the given string, the best guess is 

25 achieved by guessing in the sequence of expected occurrence per million words of text. The strategy for 
guessing with n-grams is to find the longest n-gram which matches the input numeric string. 

In the present embodiment of the invention, the system delivers a complete word to the caller via the 
text-to-speech unit if the caller's DTMF string exactly matches a dictionary entry or if the word can be 
pieced together by reference only to the Forward and Reverse Dictionaries. (These two dictionaries are 

so described in the next paragraph.) Otherwise, the system delivers the word to the caller one letter at a time, 
i.e., it spells out one guessed letter at a time moving from left to right. Various improvements are possible. 
For example, several alphabetic interpretations of an input numeric string may result from the process of 
piecing together a word from the multiple sources of string interpretations. These can be temporarily stored 
according to their probability and if the caller rejects a word, then the system can then present the next 

35 most likely word. If the caller rejects all of the pieced together words, the system can back off to a single 
letter at a time presentation. 

The Forward Dictionary contains the original 18,000 words arranged in DTMF numeric sequence and 
within identical DTMF numeric sequence by expected frequency per million words regardless of word size 
whereas the original 18,000 word dictionary is arranged primarily according to word size. (Note that the 

40 actual alphabetic character string does not participate in the sorting sequence.) 

The Forward Dictionary is constructed by repeatedly truncating the least significant letter from each 
word in the original dictionary until the remaining string is four letters long. For example, from the word 
DICTIONARY, the truncation produces DICTIONAR, DICTIONA, DICTION, DICTIO, DICTI, and DICT. Each 
of these snippets has its original expected occurrence per million words associated with it as copied from 

45 the figure for DICTIONARY. Each of these snippets is sorted and the frequency figure tabulated together 
with identical snippets taken from the same character position of other words. For example, the snippet 
DICT is tabulated with the snippet DICT taken from the word DICTATOR. The result is a single occurrence 
in the Forward Dictionary of each snippet associated with its tabulated occurrence. As the snippets get 
smaller, the frequency figures will increase. The smallest frequency will be associated with DICTIONARY 

so and the largest with DICT. The DTMF numeric is then computed for each snippet. For example, 3428 is 
computed for DICT. The tabulated snippets are sorted on the DTMF numeric so that the longest is 
encountered first in the searching. If several snippets exist with identical DTMF numerics, the snippet with 
the highest frequency is caused to be sorted first within the identical numerics. If the searching fails against 
the longest snippet, then the searching will try the next smallest and so forth until a match is made or until 

55 the search string becomes smaller than four digits. 

Several variations can be applied to the above procedure. The truncation can terminate at five or six or 
more letters instead of at four. This would cause the strategy to turn to trigrams sooner than the present 
plan. The snippet DICT from DICTIONARY could be tabulated with the snippet DICT from INDICT, i.e, from 
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a different letter position. (This suggests the construction of n-grams.) The sorting could discard multiple 
snippets with identical DTMF numerics except for the first-trie one with the largest frequency value. This is 
appropriate to the strategy of making only one word-at-a-time guess and then using trigrams and digrams to 
construct letter-at-a-time guesses. 
5 The Reverse Dictionary also contains the original 1 8,000 words. The entries are reversed so that the 
word DICTIONARY for example, appears as YRANOITCID and the corresponding DTMF numerics are also 
reversed. The sequence is by reversed numeric and within identical DTMF numeric by expected frequency 
per million words. The construction of the dictionary is the same as the Forward Dictionary except that the 
letters are first reversed. 

w The trigram tables are built similarly to the Forward and Reverse Dictionaries with several variations. 
There are five tables, one for the first, second, third, final, and "middle" digit guessing. They are used for 
guessing one letter moving from left to right so the assumption is that there are two previously confirmed 
letters preceding each digit to be guessed. Thus, the search argument format for e.g, a Trigram M (middle) 
record is AAD meaning two Alphas which are known and one Digit which is to be guessed. The D in this 

rs example is a "middle" digit - not the first, second, third, or final. The function returned by the search is 
three guesses for the D listed in the sequence of frequency, highest first. In case there is no such record 
corresponding to an AAD, there will be less than three records returned from the search. An example is 
ZZ9 which would probably return no records at all. ZZ2 on the other hand, would return at least ZZA taken 
from MOZZARELLA if this word were in the original dictionary. ZZB and ZZC would probably not be in the 

20 table. If less than three records (one for each letter on the DTMF button or four in the case of 7 and 9) are 
returned, the Digrams are used to make up the difference. 

Each of the five tables is constructed by extracting three letter snippets from the original dictionary. The 
snippets are included in one of the five files depending on the character position of the word from which 
they are extracted. Snippets taken from characters 1,2, and 3 of a word are sent to Trigrams Initial. 

25 Snippets from 2,3, and 4 are sent to Trigrams 2. Snippets from 3, 4, and 5 are sent to Trigrams 3. Snippets 
taken from the final letter of the word and the two preceding letters are sent to Trigrams Final. All other 
snippets are sent to Trigrams Middle. These are then sorted, the frequencies are tabulated, DTMF numerics 
generated and sorted again by DTMF numerics and within identical numerics by frequency, and stored in 
tables similarly to the Forward and Reverse Dictionaries. 

30 There are some variations from the above procedure applied to building the Trigrams Initial and 
Trigrams 2 tables. For Trigrams Initial, it may be that the Reverse Dictionary search was provided one or 
two letters useful in the Trigrams Initial Search. The one letter could be the third letter of the word. The two 
letters could be the second and third letters of the word, therefore, the search argument formats could be 
DDD (no letters known of the first three in the word), DDA (the third letter is known from the Reverse 

35 Dictionary search), or DAA (the second and third letters are known from the Reverse Search.) The functions 
returned from the search will be formatted properly to respond to these search arguments. Naturally, the 
expectation of a proper guess for the D (the first digit of the Trigrams Initial record) will be more accurate if 
one or two letters are part of the search argument (DDA or DAA) than if the argument is DDD. 

For Trigrams 2, it may be that the Reverse Dictionary search has provided the third letter of the word. 

40 (If the Reverse Dictionary search also provided the second letter of the word, then it would not be 
necessary to reference Trigrams 2). If the third letter is available, the search argument will be ADA, 
otherwise it will be ADD. (The first letter has been confirmed by the user after presenting him with the 
results of the Trigrams Initial search.) Similarly to the Trigrams Initial search, the Trigrams 2 search will be 
more accurate with the search argument format ADA than with ADD. 

45 As illustrated in FIGS. 3A - 3E, this invention uses the maximum length string available from a Forward 
Dictionary search, combining this with the maximum length string available from a Reverse Dictionary 
search, and then using forward trigrams and digrams to bridge (chain) the gaps which may remain after 
combining the forward and reverse alphabetic strings. The forward and reverse dictionary searching extracts 
prefixes and suffixes as a by-product" of the searching. 

so The results of the combination of the two strings can be longer than the input string, shorter than the 
input string, or the same size as the input string. If the combined strings are longer than the input string, 
then the "overlap" letters may or may not match. If the overlap letters match, then this is further evidence 
that the correct and only correct interpretation of the input numeric string has been found. If the overlap 
letters do not match, the two alphabetic strings concatenated together may still be the correct interpretation 

55 of the input numeric string. If one or more letters overlap, the overlap letters are unequal, and the strings 
are not the same size, the overlap letter is chosen from the longer of the two strings. If the strings are the 
same size, the overlap letter is chosen by reference to a trigram or digram table to select the most likely 
alphabetic according to the frequency data. If the combined strings are the same size as the input string, 
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, then there are no overlap letters. If the two matched alphabetic strings are shorter than the input numeric 
string, then the gap between the forward string and the reverse string will be filled in by reference to 
trigrams and digrams. 

Referring to FIG. 3A, the process is performed if the search against the original dictionary (the 18,000 

5 words) fails. The first search in FIG. 3A is made against the Forward Dictionary. Since the original search 
against the 18,000 words is known at this point to have failed, this search begins with truncating the last 
digit from the input string and attempting to find a match against a Forward Dictionary entry one character 
shorter than the input string. 

It will be useful to follow the process with an example. Suppose that the input numeric string is 

io 34726837437 intended to mean DISCOVERIES and that DISCOVERIES is not one of the 18,000 words. The 
words DISCOVER and CHERRIES are assumed to be contained in the 18,000 words and are thus also in 
the Forward Dictionary and the Reverse Dictionary. At circle #1 , the final 7 is truncated from the input string 
and the size of the now truncated string is computed as S = 10. The Forward Dictionary is searched with the 
argument string size of 10 but no match is found on the 10 digit string. The string size is reduced by one 

75 and the next least significant digit, 3, is truncated from the tail of the numeric string. Searching against the 
Forward Dictionary continues until a match is found when S = 8 and the alphabetic interpretation is 
DISCOVER. DISCOVER is saved and the process continues at circle #2. 

The string is reversed so that it is now 73473862743 and it is in the proper format to match against the 
Reverse Dictionary. As in the Forward Dictionary search, the string is truncated and the search is made with 

20 size Q. This is repeated until a match is made when Q = 4 against 7347 which corresponds to RIES, the last 
four letters of CHERRIES (which is entered in the Reverse Dictionary as SEIRREHC). 

Reference is made to prefixes in the Forward Dictionary search and to suffixes in the Reverse 
Dictionary Search. There is an auxiliary dictionary of some 40 prefixes (e.g., anti-, circum-hyper-) and some 
80 suffixes (e.g., -hood, -meter, - ious). If a search produces an exact match against a prefix or suffix, then 

25 the prefix or suffix is truncated from the input string and the search is repeated with a new argument string. 
This allows the system to decode words such as ANTIDISESTABLISHMENTARIANISM and CIRCUMNAVI- 
GATE, even though they may not be in the dictionary per se. 

In FIG. 3B, if the system can guess an alphabetic interpretation of the complete numeric string using 
only the Forward and Reverse dictionaries with n at least four for both strings, then the system will guess 

30 the word and speak it to the caller as if it had been found in the original dictionary. If the system requires 
trigrams and/or digrams to guess the alphabetic interpretation of the numeric string, then it will speak the 
word to the caller a letter at a time. 

In FIG. 3C, the results of the Forward Dictionary search and the Reverse Dictionary search are 
assembled and seen to be DISCOVER and RIES. Since S=8, Q=4, and the input string is 11, there is a 

35 one digit overlap. This overlap is the digit 7 which is equated to the letter R, the final letter of DISCOVER 
and also to R, the initial letter of RIES. Since the overlapping letters are the same, the assumption is made 
that the numeric string has been successfully interpreted and the system will speak the word DISCOV- 
ERIES to the caller. 

In FIG. 3C at circle #5, the process of filling in gaps in the results of the Forward and Reverse 
40 Dictionary searches is begun. 

FIG. 3D - 3E diagrams the process used if the Forward and Reverse Dictionaries did not yield a guess 

for the input numeric string. At circle #7, the first unsolved digit is located and its digit position denoted. 

The digit position determines which of the five Trigram files will be used to decode it. If the Trigram Files 

do not contain sufficient entries to provide a full Guessing stage for each digit, then the final default, the 
45 Digram File is used to finish loading the stage. Since there are three letters per DTMF button, (except for 7- 

PQRS and 9-WXYZ) there must be three guesses to offer the caller per each button push he enters and 

they must be presented in the sequence of their likelihood as a successful guess. The Guessing Stage 

provides the mechanism for presenting each possible letter per numeric button in its proper sequence and 

for presenting the letters once and only once, 
so The guessing process of FIGS. 3C-3F is one letter at a time whereas the guessing of FIG. 3A-3B is a 

complete word using the assumption that a complete word guess had been successfully constructed from 

the Forward and Reverse dictionaries. 

The following is an example of the letter-at-a-time process for deciphering an known DTMF numeric 

string by reference to the Forward and Reverse Dictionaries derived from known dictionary entries. 

55 



10 



EP0 319 193 A2 



5 



15 



20 



40 



45 



50 



55 



Example 1 

The length of the combination of the matched forward string plus the matched backward string is 
greater than or equal to the length of the input string. 

UNKNOWN STRING 72887329 (intended to be SATURDAY) 



72823 SAUC(e) 

>° FORWARD DICTIONARY 728876 SATUR(n) 

72887 SATU(rn) 

7238 SAT(urn) 

7297 RAYS 



92368 (t)ODAY ( reversed- YADOT) 



REVERSE DICTIONARY 923738739 (y) ESTERDAY (reversed=YADRETSE) 

924 (h)AY (reversed=YAH) 



25 Step 1 : Match the unknown string against the longest string available from the Forward Dictionary 

which is 72887 of SATUR{n). 

(\gsults of forward match: 

30 UNKNOWN STRING 72887 matched, 329 unmatched 

I'ORWARD DICTIONARY 72887 SATUR(n) 



35 Step 2: Reverse the digits of the unknown string. 

Results of reversal of the digits of UNKNOWN STRING: 92378827 

Step 3: Match reversed digits against the longest string available from the reverse dictionary. 



^jsults of reversed digit match: 

tfWKNOWN STRING 9237 matched, 8827 unmatched 

REVERSE DICTIONARY 9237 (YADRestsey) 

Step 4: Reverse to the forward direction those digits that matched the REVERSE DICTIONARY 
together with the corresponding letters. 
Results of reversal: 7329 RDAY 

Step 5: Concatenate the forward matched string with the reverse matched string. Concatenate the 
corresponding letters of the forward matched string with the corresponding letters of the reverse matched 
string. The overlaps are indicated in the results below by the vertical bars (|). 

Results of concatenation: 

forward String: 7288 |7| SATU|R| } 

Reverse String j 7 1 329 | R j DAY } SATURDAY 
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Step 6: Note the overlapping digits (inside the vertical bars) between the two string and note whether 
the corresponding letters are the same. If the digits are the same, no further processing is necessary. 

If the digits are not the same, one or the other must be chosen for the guess. (This example of 
SATURDAY will be used even though the 7 digit represents an R in both strings). 

If the strings are of different lengths, take the letter from the longest string, in this case, the SATUR. If 
the strings are the same lengths, refer to the expected frequencies per million associated with the two 
strings and choose the letter from the string with the highest frequency. (Recall that the n-grams are stored 
together with their expected occurrence per million words of text. 



Example 2 

The length of the combination of the matched forward string plus the matched backward string is less 
75 than the length of the input string. In this case, there are one or more unmatched digits. These are 
processed by further reference to n-grams until the length of any substring is less than four digits. Then the 
remaining unmatched substrings are decoded by reference to trigrams and digrams. The trigrams will 
usually decode any unmatched digits (except for foreign proper nouns and words) and the digrams are the 
final default reference. The digram table contains every possible combination of two letters. 
20 The basic dictionary building system illustrated in FIGS. 4A-4C is used to build the following 
dictionaries. 

The basic system illustrated in FIG. 2B and described in FIGS. 3A-3F includes a group of dictionary 
files 43a-43n which are discussed in detail as follows: 

25 

Original Word Guessing Dictionary 42 

This is the original set of 18,000 some words in ascending sequence of size, DTMF numeric translation 
and within DTMF numeric translation sequence in descending order of their expected occurrences per 
30 million words of text (EM). For example, the words DOG and FOG would appear in the section of the 
dictionary for three letter words as: 
364, 23, DOG 
364, 14, FOG 

ssuming that 23 and 14 were the EM associated with DOG and FOG espectively. 



Forward Dictionary 43a 

This is the original set of 18,000 some words in ascending sequence of DTMF numeric translation and 
40 within their DTMF numeric sequence in descending order of their expected occurrences per million words 
of text (EM). Word size is secondary to DTMF in this dictionary whereas it is the prime sequencing criteria 
for the original dictionary. The words are sequenced the same way as they would be in an ordinary 
dictionary with the exception that they are sequenced on DTMF numeric instead of the actual alphabetic. 
The sequence is arranged so that longer strings taken from a given word fall before shorter strings taken 
45 from the same word. For example, DICTIONAR (the numeric is 342846627) would all before DICTIONA (the 
numeric is 34284662). The sequencing of longer strings before shorter strings is what allows the longest 
string possible to be matched against the Forward Dictionary. For equal strings, e.g., two strings of 
34284662 which have different alpha translations, the string with the highest EM falls first. 

The Forward Dictionary file is constructed by taking each word in the Original Dictionary and 
so successively truncating the final character to make a new record until the new record is four characters 
long. For example, the original entry DICTIONARY would provide the records DICTIONAR, DICTIONA, 
DICTION, DICTIO, DICTI, and DICT. 

After each word is taken and truncated and sent to a file together with its EM, the records with equal 
DTMF numerics are tabulated and their EM are added together to produce the total EM for that alphabetic- 
55 numeric string combination. Thus, if there are two strings, 34284662 each with a different alphabetic 
equivalent, the EM associated with each string are the total EM taken from all strings with the same 
alphabetic and thus the same numeric. For identical numeric strings with different alphabetic strings, the 
string with the highest EM occurs first. 
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Reverse Dictionary 43b 

This is the dictionary obtained by reversing the letters of all the words of the Original Dictionary and 
then continuing to produce the dictionary in the same way as the Forward Dictionary. 

5 

Trigram Files 43e-43i 

There are five trigram files: TRIGS 1 43e; TRIGS 2 43f; TRIGS 3 43g; TRIGS M 43h and TRIGS F 43i; 

70 also referred to respectively as: Trigrams Initial; Trigrams 2; Trigrams 3; Trigrams Middle and Trigrams 
Final. The reason for using five files is that tetters have different expected frequencies of occurrence 
depending on their positions in a word. The particular letter positions considered to have special 
frequencies associated with them are the first, second, third, and final positions in a word. These four 
positions have special trigram files for guessing unknown digits in these positions and there is a fifth file for 

is 'all other" digit positions. 

The format of a search argument into a trigram file is usually AAD where AA represents two known 
alphabetic characters and D represents an unkown digit which is to be guessed by reference to the trigram 
files. The AA are known because the guessing proceeds from left to right when using the trigram files and 
the letters are presented to the caller one at a time for confirmation or denial. 

20 Trigrams Initial 43e and Trigrams 2 43f have special search arguments because for these two files, 
there are not two known alphabets available to use as the search argument. These two files may be used in 
searching input DTMF strings which have previously matched the Reverse Dictionary (but not the Forward 
Dictionary) and possibly picked up one or two alphabetic characters in the second and third character 
positions. If no alphas were retrieved by the Reverse match, then the format of a search argument against 

25 the Trigrams Initial File 43e is DDD, i.e., three numeric Digits because no alphas are known. If the third 
character was retrieved by the search against the Reverse Dictionary, then the search argument format is 
DDA (two unkown digits and one known Alphabetic). If the second and third characters were retrieved by 
the search against the Reverse Dictionary, then the search argument format is DAA (one unknown Digit and 
two known Alphabetic). 

30 Trigrams 2 43f may have a search argument format of ADD if no character was located in the Reverse 
search or ADA if the third character was matched in the Reverse search. 

The sequence of the trigram files 43e-43i is numeric or alphanumeric search argument as above, 
totalled EM, and three or four guesses for each DTMF digit depending on whether the digit is a 7 or 9 (four 
guesses) or otherwise (three guesses). 

35 

Digram Files 43c and 43d 

The Digram File 43c and 43d is a set of two letter pairs which combine each letter of the alphabet with 

40 every other letter of the alphabet sequenced according to their expected appearance on the DTMF keypad. 
There are 26 times 26 entries which is every possible combination of two letters. The Trigram files 43e-43i 
will not necessarily have an entry for every possible letter combination but the Digram File 43c and 43d 
does. It is the fall-back tool used to guess the meaning of a DTMF digit if the Original Dictionary 42, the 
Forward Dictionary 43a, the Reverse Dictionary 43b, and the trigram files 43e-43i do not provide adequate 

45 input to enable a guess for a DTMF digit. The Digram Files 43c and 43d are produced from an independent 
data source, not from the Original Dictionary 42. 

The format of a search argument to the Digram File 43d is AD where A is a previously confirmed 
Alphabetic character and D is an unknown Digit from 2 to 9. The functions returned from searching Digram 
File 43d are the guesses for the unknown digit. There are three guesses for an unknown digit listed in order 

50 of their expected frequency. For example, for the argument T4, the search of the Digram File 43d yields 
three records in the sequence TH, Tl, and TG. (The digits 7 and 9 are expected to return four digits 
because there are four letters on these two buttons.) The Trigram files 43e-43i may not yield a complete 
"set" of guesses for a digit, e.g., H, I, and G, for the T4 search but the Digram files 43c and 43d are 
constructed to contain an exhaustive listing for every possible interpretation of the unknown digits. This is 

55 why the Digram Files 43c and 43d are referred to as the "final default". 

There is a special Digram File 43c with 26 entries called Digram Initial. This is used for the first letter of 
the word when there is no alphabet to supply the A of the AD format. The search argument is just D. The 
Digram Initial File 43c (as well as the other Digram File 43d) are expected to be used infrequently as the 
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system first refers to the Forward Dictionaries and the Trigrams 43e-43i. 



N-GRAM FILES 43j-43n 

5 

N-Gram-lnitial File [N-Gram 1) 43j 

The first letter of an entry in the N-gram Initial File 43j is the first letter of the dictionary entry from 

jo which it was extracted. "Initial" refers to the letter position in the word from the n-gram was extracted. 
"Initial" is the first letter of the word. Separate n-gram 43j-43n files are maintained for five different letter 
positions because the frequency of occurrence of a letter depends in part on its position in a word. N-gram 
files 43j-43n are maintained for n = initial, second, third, final, and other than these four. Therefore, there are 
five n-gram files in addition to the original dictionary plus the reverse dictionary. There is also the digram 

15 file which is used as the last resort default in decoding a digit string. 

The last letter of an entry in the N-Gram Initial File 3j is the next-to-the-last letter of the dictionary entry 
from which it was extracted. The last letter is not included because the search against the original dictionary 
containing the entry, e.g., SATURDAY, has apparently failed or the system would not be using n-grams. 
The EM is taken from the dictionary entry. 

20 Example: The n-gram SATURDA is extracted from the dictionary entry SATURDAY. The EM of SATURDAY 
is, e.g., 243, which is also applied to the n-gram SATURDA. THe n of the n-gram SATURDA is 7-gram I (I 
meaning "Initial"). SATURDA is a seven letter n-gram whose first letter is the first letter of the dictionary 
entry from which it was extracted. 

N-grams are tabulated together with duplicate n-grams having the same letter position and their EM are 

25 tabulated. If another original entry in the original dictionary was SATURDAY'S and the last three characters 
(Y'S) were truncated in the process of generating n-grams, then this SATURDA would be compared to the 
first SATURDA, found to be equal, and the EM of the two occurrences of SATURDA added. If the EM of 
SATURDAY'S were 101, then the tabulation process would yield a single entry of SATURDA and its EM 
would be the sum of 243 and 101 = 344. 

30 The process of generating entries for the N-Gram-lnitial File 43j continues with successively truncating 
the last letter of each original dictionary entry. For example, the successive truncation of SATURDAY yields 
SATURDA, SATURD, SATUR, SATU, and SAT each with the associated EM of SATURDAY. The process 
stops when the remaining letter string is three characters long. Each entry stops resulting from the 
truncation process is tabulated against identical letter strings. As the letter strings get shorter and shorter, 

35 e.g., from SATURDA to SAT, the EM totals get higher and higher. 

The final operation in creating the n-gram files 43j-43n is to generate the DTMF numeric equivalent 
from the letters, sort on the numeric equivalents in ascending order and the summarized EM in descending 
order. The alpha does not take part in the sort as it is now unique (only one occurrence of a given alpha 
string) because of the tabulation. There would be perhaps five entries having the same numeric equivalent, 

40 728, of which SAT was one. They would appear in the N-Gram Initial File 43j as: 
728, 564, SAT 
728, 521, SAU 
728, 493, SAV 
728, 447, RAT 

45 729, 321, PAT 

in the decoding process, if it was required to decode the string 728 and 728 were the initial three digits 
in the word, then SAT, SAU, SAV, RAT, and PAT would be the most likely interpretations of 728 in order of 
their expected EM. The attempted decoding of 728 would only occur after failing to decode the longest 
possible string in the input numeric. In the case of SATURDA for which the input numeric string is 7288732, 
so the decoding process attempts to decode 7288732, then 728873, then 72887, then 7288, and lastly 728. 
The n-gram files 43j-43n are organized to facilitate the "longest string first" search. 



N-Gram-2 File 43k 

The first letter of an entry in the N-Gram-2 File 43K is second letter of the dictionary entry from which it 
was extracted. For example, from SATURDAY, the first entry in the N-Gram-2 file 43k is ATURDA. The 
entry is successively truncated as in 1.6 above yielding ATURD, ATUR, and ATU each with the associated 
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, EM of SATURDAY. These entries are tabulated and sorted as in 1.7 above to yield the N-Gram-2 File 43k. 



N-Gram-3 File 43! 

5 

The first letter of an entry in the N-Gram-3 file 431 is the third letter of the dictionary entry from which it 
was extracted. For example, from SATURDAY, the first entry in the N-Gram-3 File 431 is TURDA. The entry 
is successively truncated as in 1.6 above yielding TURD, and TUR each with the associated EM of 
SATURDAY. These entries are tabulated and sorted as in 1.7 above to yield the N-Gram-3 File 43I. 



N-Gram-M File 43m 

The first letter of an entry in the N-Gram-M <"M" means "middle of the word") File 43m is any letter 
75 which is not the first, second, third, or last letter of the dictionary entry from which it was extracted. For 
example, from SATURDAY, the first entry in the N-Gram-M File 43m is URDA. This is truncated to yield 
URD. The next entry is RDA. Each entry is associated with the EM of Saturday. These entries are tabulated 
and sorted as in 1.7 above to yield the N-Gram-M File 43m. 

20 

N-Gram-F File 43n 

The last letter of an entry in the N-Gram-F ("F" means "final letter of the word) file 43n is the final letter 
of the dictionary entry. The entries are stored in reverse order. This makes them available for retrieval from 

25 the right side of the word moving in the direction from right to left. For example, from SATURDAY, the first 
entry in the N-Gram-F File 43n is YADRUTA. The entry is successively truncated yielding YADRUT, 
YADRU, YADR, and YAD, each with the associated EM of SATURDAY. These entries are tabulated and 
sorted as in 1.7 above to yield the N-Gram-F File 43n. 

FIGS. 4A, 4B, and 4C illustrate in detail the method by which Trigrams are generated and employed In 

30 accordance to the preferred embodiment of the invention 200. 

The improvement 200 lies in the method and apparatus for using tables of 3 letter groups called 
Trigrams and tables of 2 letter groups referred to as Digrams. The tables yield letter guesses which are 
spoken to the user 10 one letter at a time. The guesses can be denied by the user by pressing the # button 
or confirmed by taking no action. 

35 An example of the manner in which the system is employed is described below. Initially, assume that 
the sender 10 wishes to send the word RICHARD to the receiver 22. Further, assume that the word 
RICHARD is not found in the dictionary 42 so that the system 100 switches to the letter mode. The string 
from the DTMF pad is 7424273. 

Assuming that the match against the Forward and Reverse Dictionaries failed, the system 100 initially 

40 attempts to identify the word RICHARD using five (5) Trigram files referred to respectively as: TRIGS I 
(Trigrams I); TRIGS 2 (Trigrams 2); TRIGS 3 (Trigrams); TRIGS M (Trigrams Middle); and TRIGS F 
(Trigrams Final). 

TRIGS I is used to guess at the first letter of a string based on the first three digits of a word. The 
argument of a TRIGS I record is the first three numeric digits corresponding to the first three letters of a 
45 word. The function of a TRIGS I record is a guess for letter #1 of the subject string. In this case, the 
argument of 742 for four TRIGSI records yields functions of e.g., R,S,P,Q which are the possible guesses 
for the 7 listed in order of their expected frequency in English. TRIGS I was derived from a study of over 
eight million English words. 

The table was processed so as to add the numeric for the Trigrams. For example, the entry RIC (the 
50 first three letters of RICHARD) was processed to generate 742R-107.9. The 742 is the DTMF numeric 
equivalent of RIC. The 107.9 is the frequency of RICHARD per one million words of English text. 

The Trigrams were sorted on the numerics (742) and reverse sorted on their frequencies (to make the 
larger numbers fall before the smaller frequencies - e.g., 107.9 above) and tabulated to derive TRIGS I. 

An entry of TRIGS I has the format nnnXf where nnn is the numeric string as in the example of 742 
55 above. The X is the guess for the first digit of nnn as the R in the example of 742 R above. The f is the 
summation of all initial strings 742 for which the 7 was found to be the letter R. 

An entry may be missing if it was so infrequent as to be missing from the original table of English 
Trigrams. For example, 742Q may not exist There is a procedure for missing entries. If a single entry out 
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of a possible three or four (the number of letters on a DTMF button) entries is missing, it is assumed to be 
the least frequent of the three or four possible entries. If one or more are missing, the system defaults to 
using the DIGRAMS table described below to supplant the missing entries expected from the Trigram 
tables. 

5 TRIGS 2 is used to guess at the second letter of a string based on the first letter of a previously 
guessed and confirmed output string combined with the next two digits of the input strings. TRIGS 2 
records have arguments of one letter plus two numeric digits. The TRIGS 2 function is a letter guess for 
letter #2 of the subject string (7424273). In this example, searching TRIGS 2 on R42 would yield three 
records with functions of l,H, and G which are the possible guesses for the 4 listed in the sequence of their 

io expected frequency (occurrences per million words of text) in the English language. The expected 
frequency is included in each record. This is the frequency figure taken from the word from which the 
Trigram was obtained, i.e., in this case, RICHARD and Tabulated together with other identical letter strings. 

TRIGS 3 is used to guess at the third letter of a string based on the first two letters of a previously 
guessed and confirmed output string combined with the next digit of the input string. TRIGS 3 records have 

js arguments of two letters plus one numeric digit. The TRIGS 3 function is a letter guess for letter #3 of the 
subject string (7424273). In this example, searching TRIGS 3 on RI2 would yield functions of e.g., C.A.B 
which are the possible guesses for the 2 listed in order of their expected frequency in English and the 
expected frequency figures. 

TRIGS M is used to process all the digits in the input string after digit 3- i.e., beginning at digit 4 and 

20 continuing to the next to the last digit of the input string. 

Subsequent processing of 7424273 after referring to TRIGS I, TRIGS 2, and TRIGS 3 is accomplished 
by "moving a three digit window" over the input numeric string, one digit per move, from left to right and 
using the new digit plus the previously guessed two letters to guess the new digit by reference to TRIGS M. 
In this example, the output string is RIC after reference to TRIGS 3 and the next digit to be processed 

25 is the 4 (meaning the H). The argument is IC4 and TRIGS M will yield the functions of e.g., H,I,G which are 
the possible guesses for the 4 listed in order of their expected frequency in English. The output string is 
then RICH and the next digit to be guessed is the 2 (meaning the A). This continues until the next to the 
last digit has been processed. 

The reason for employing five files: TRIGS 1, TRIGS 2, TRIGS 3, TRIGS M, and TRIGS F is because 

30 the initial, second, third, and final letters of words have different frequencies of individual occurrence and 
different frequencies of three letter combinations than the letters in other positions in words. Using five 
different files takes maximum advantage of the knowledge of the subject string in terms of each letter in its 
position within the word and within the context of its surrounding letters and digits. 

When the system guesses using TRIGS I, it knows that the letter to be guessed is the first letter of the 

35 word. 

When the program guesses using TRIGS 2, it known one letter and two numeric digits and it knows that 
the letter to be guesses it the second letter of the word. 

When the program guesses using TRIGS 3, it knows two letters and one digit and it knows that the 
letter to be guessed is the third letter of the word. 
40 When the program guesses using TRIGS M, it knows two letters and one digit and it knows that the 
letter to be guessed is after the third letter in the word but not the final letter of the word. 

When the program guesses using TRIGS F, it knows two letters and one digit and it knows that the 
letter to be guessed is the final letter of the word. 

The assumption inherent in the use of the Trigram files is that unknown words (words not in the 
45 dictionary) will have the same characteristics as those words which are in the dictionary. These are the 
characteristics of letter frequencies and furthermore, frequencies depending upon position within a word. 
The choice of a guess for an unknown digit is primarily constrained by the restriction of the relation of the 
letters to the digits on the DTMF dial. For example, if ft is required to guess the letter or the digit "8", then 
it is known that the letter must be one hosen from the set of T,U, and V. The employment of the Trigram 
so files greatly enhances the probability that the choice will be the correct one. 

FIGS. 4A diagrams the process of creating the Trigram files from the dictionary. The dictionary contains 
some 18,000 words in the English language together with their expected occurrence per million words of 
text. Words of four or less letters are excluded from the Trigrams population as they are all assumed to be 
included in the dictionary whereas the Trigram files are created for the purpose of guessing words which 
55 are not in the dictionary. 

In the example of FIG. 4A, the word DISCOVERY is chosen to illustrate the process of creating the 
Trigram files. 

The Trigram I Records are generated for the purpose of guessing the first letter in the string. The first 
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three letters are converted to their DTMF numerics-i.e., the DTMF button on which they occur. In this case, 
DIS is converted to 347 which is stored in the Trigram I record. Then the first letter of the word, D, is 
inserted into the record. Then the expected occurrences per million words of text (265.8) is added to the 
record. The 347 is the argument and the D is the function. The dash is shown to illustrate what is known 

5 versus what is to be guessed when the record is referenced. The known data (e.g., the 347 or the 34S or 
the 3iS) is to the left of the dash and the data to be guessed (e.g., the D) is to the right of the dash. The 
use of the expected occurrences per million words of text (e.g., 265.8) will be explained below. 

The Trigram 2 Records are generated for the purpose of guessing the second letter in the string. The 
argument to the left of the dash is D47 or D4S. 

10 The Trigram 2 records contain one known first letter and two numeric digits representing two unknown 
letters, the second and third. The I to the right of the dash is the guess for the digit 4 of the 47 in this 
particular record. The expected occurrences per million words of text apply to this combination of letters 
and numbers. This frequency applies particularly to the I as a second letter and the string of D47 will have 
different statistics if the I is any letter other than the second in the string and if the first letter is other than a 

is D and the third digit in the string is other than a 7. 

The Trigram 3 records, Trigram N records, and the Trigram F records have the same format. The 
argument is two known letters and one unkown digit. 

In the example of the Trigram 3 record, the argument is 17 to the left of the dash and the function is the 
letter S to the right of the dash. The S is the guess for the 7 and the 265.8 is as above. The statistic applies 

20 particularly for the DI7 where the 7 is the third digit of the string and the Dl are the first and second letters 
of the word. 

In the example of the Trigram M record, the argument is SC6 to the left of the dash and the function is 
the letter 0 to the right of the dash. The 0 is the guess for the 6 and the 265.8 is as above. The statistic 
applies particularly for the SC6 where the 6 is not the first, second, third or final digit of the string and the 
25 SC are not the first or second letters of the word nor are they the third and second from the last letter. 

Finally, there is the Trigram F record which is especially created for guessing the last letter of a word. 
This has the same format as a Trigram M record but the expected frequencies of the letters as 
corresponding to the digits are not the same at the end of a word as they are in the middle or beginning of 
a word. The statistic applies particularly for the ER9 where the 9 is the last digit of the string and the ER are 
30 the third and second from the last letter. 

Four examples - DISPARATE, DIRECTLY, FISSURE, AND EGREGIOUS are shown as the source of 
Trigram I records which match DISCOVERY on the digits 347 but which have different functions for the first 
letter and different values for the expected occurrences per million words of text. 

Processing the complete dictionary as described above produces five files - TRIGS I (Trigrams Initial), 
35 TRIGS 2 (Trigrams 2), TRIGS 3 (Trigrams 3), TRIGS M (Trigrams Middle) and TRIGS F (Trigrams Final). 
Each of these files is processed separately as follows: 

a. Sort on the first four characters (letters or digits). In the example of Trigram 2 of FIG. 2, sort on the 
D47 plus the letter character which is an S or P or R or Q. 

b. For all records having identical first four characters, tabulate (add up) the expected frequencies per 
40 million words and produce a single record with the total frequency for that combination of the first four 

characters. 

c. Resort the tabulated records on the primary key of the first three characters. Sort on the secondary 
key in descending order of the totalled expected frequency per million words. Five sets of records of this 
description are shown in FIGS. 4B and 4C. 

45 

The results of these operations are shown in FIG. 4A ready to be loaded into the program tables. For 
each three character string (Trigram) made up of digits or letters and digits, there is a record containing its 
absolute frequency in terms of expected occurrence per million words. They are sorted according to the 
three digit string value and within a given string value according to their frequency. These five files are 
so organized according to the DTMF button depression which requires a guess and according to its position in 
the string being processed. 

Having the frequencies available, it is then possible to compute the probabilities that an anticipated 
letter guess will be correct. This probability per letter can be concatenated with the probabilities for all of 
the letters in the word and the probability that the complete word will be correct can then be computed. If 
55 this probability exceeds some threshold, then the whole word can be spoken to the user instead of using 
the letter mode to speak the word one letter at a time. 

The probability that all the letters on a particular button contain the correct guess for a particular DTMF 
button depression is 1.0 in total. For example, the probability that the correct guess is A, or B, or C for a 



17 



EP 0 319 193 A2 



DTMF button depression of "2" is 1.0. The probability that a particular letter choice is correct is the 
frequency for that letter divided by the total frequencies for ail the letters on the button. In FIG. 4B in the 
example of Trigram I, the probability that D is the correct guess for the 3 of the 347 is the sum of 498.7 
plus 311.6 plus 202.7 which is 1013 divided into 498.7 which is .49. The probability that the complete word 
5 would be correct if spoken (instead of speaking one letter at a time) is the product of the probabilities for all 
of the letters in the word. 

In case one of the Trigram tables does not contain a searched entry, the system refers to a table of 
Digrams, DIGRAMS, which lists each of the twenty-six letters as the first letter of a pair combined with each 
of the twenty-six letters as the second letter of a pair. The argument to the table is a letter and a digit. The 

m letter is the last confirmed one of the current alphabetic output string and the number is the next one of the 
current numeric input string. In the example of RICHARD, if IC4 had no entry in TRIGS3, then the system 
looks in DIGRAMS with the argument C4. The table would yield the functions, e.g., I,H, and G, which are 
the possible guesses for the digit 4 preceded by the letter C listed in order of their expected frequency. 
The DIGRAMS table is based on "Chart Showing FREQUENCIES of English DIGRAMS" -Prepared by 0. 

75 Phelps Meaker which is an appendix to "Elementary Cryptanalysis". 

These frequencies are not included in the computer tables as they are incompatible with the 
computations used to arrive at the Trigram table frequencies. Also, the expectation of using a Digram table 
entry is very low as the Trigram tables contain all but the most improbable combinations, e.g., QZ is in the 
Digram table and is used if required but is not expected to be used frequently if at all. The frequencies of 

20 the Digram table are recognized only in their relative values, i.e., in the example of C4 above yielding l,H,G, 
the letter are presented in that order due to their relative frequencies. 

If the system uses a Digram table entry for letter guessing, then it will probably not attempt to guess a 
complete word at a time but will only guess a tetter at a time. On the basis of the low frequency of 
reference to a Digram, the system may assume the frequency to be zero and make a whole word guess on 

25 that basis. 

The system maintains a "Guessing Stage" when processing letters one at a time. This is a storage area 
where all of the letters for a given DTMF digit are assembled in the sequence in which they will be 
presented for guessing by the user. For example, if the digit to be guessed is a "2", then the stage will 
contain e.g., CAB which is the order of their frequency and the order in which they will be presented for 

30 guessing. The Stage is loaded by reference to the Trigram tables and possibly by reference to the Digram 
Tables. The stage is first loaded from the Trigram tables and if it contains all the letters for a DTMF digit, 
e..g, the three letters C,A, and B taken from the Trigram files then the load is completed. If a letter or letters 
is missing, e.g., the B because it was not present in the Trigram file, then the Digram file would supply the 
missing letter (not the frequency per million statistic. 

35 The Guessing Stage is used to ensure that each possible letter is presented to the user for approval or 
rejection once and only once and in the proper sequence. If the user rejects ALL the letters of the stage, 
the system makes the assumption that the letters were spoken too quickly and that the user did not have 
time to reject the incorrect letters. In this case, the system increases the time available to the user to 
perform rejection and the whole stage is repeated for the rejected letters. 

40 The user may reject a word at a time as well as a letter at a time. When a word is spoken to the user, 
the system waits a time-out period for a reject. If the system receives a reject (the user pushes the # 
button), it will present the next choice word to the user. For example, the system speaks the first choice 
THE. If the user rejects, it then speaks the second choice TIE. If no more words are available in the 
dictionary that match the current string (in this case, 843) then the system enters LETTERS mode. If no 

45 reject is received before the timer expires, the system assumes the word was correct. 

If the # button is pushed after the timer expires, the meaning to the system is CANCEL THE LAST 
WORD. The system cancels the last word and assumes that the user did not have enough time to reject in 
the normal manner. It then extends the time-out period as described above. 

Summarizing the improvements to letter-at-a-tjme guessing, the current patent discusses guessing 

so single letters as isolated events, e.g., to guess an 8, the system would offer T,U, and V. The improvements 
encompass guessing digits in context of their surrounding digits, in the context of previously confirmed 
guesses of preceding letters, and in the context of their position within the subject string. The devices 
employed to execute this improved guessing procedures are the Forward Dictionary 43a, Reverse 
Dictionary 43b, Trigram 43e and 43i and Digram files 43c and 43d and may n-grams 43j and 43n. 

55 There are a number of useful applications for the basic invention just described. 



Alphanumeric Paging 29 
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The paper 29 is connected to the system 100 of FIG. 1 by an alphanumeric pager computer 27 and a 
PTN access node 17. Presently, pagers 29, illustrated in FIG. 1, also known as "beepers" can be classified 
according to functionality in several levels. 

At the lowest level of functionality is the plain beeper 9. When the "beep" sounds, it tells the carrier of 
5 the beeper 29 to call his office. His office will then tell him who was trying to call him, the number to call, 
etc. 

The numeric pager 29 displays the next higher level of functionality. When this machine sounds a 
"beep", the pager 29 displays a digital readout of the caller's telephone number. The carrier can then call 
this person directly instead of calling his office to find out who the person is. 

w Pagers 29 may receive and store voice messages similarly to telephone answering machines. The 
alphanumeric pager 29 displays the highest level of functionality. It receives and stores alphanumeric 
messages such as "Flight cancelled. What do I do now?" To send such a message, the sender calls an 800 
(Inward Wide Area Telephone Service) number supplied by the paging company and dictates the message 
to an operator. The operator, using an alphanumeric keyboard, types the message into a computer 27 and 

75 the computer 27 sends the message to the recipient. 

This technology of this invention allows transmission of alpha paging messages without the assistance 
of an operator and without the requirement to call an 800 number. The TAN (telephone access node) 
receives the message from the sender and interacts with him to ensure that the message is entered 
correctly. In then sends the message via a data network (rather than a voice network) to the paging 

20 company's computer which in turn sends it to the recipient. Thus, the technology disestablishes two costs - 
the cost of the operator and the cost of the 800 call. 



Telex 24 and 26 

25 

It is generally accepted that an alphanumeric data entry device, such as a Telex terminal 24 and 26 
shown in FIG. 1, is required in order to send a message to a receiving Telex terminal 24 and 26. This 
technology obviates the need for a sender to have access to a Telex 24 or 26 or other alphanumeric input 
device. The sender can enter the message using a DTMF keypad communicating with a TAN. The TAN 
30 interacts with the sender ensuring the validity of the message. The TAN converts the message to Telex 
format and sends it to the destination Telex terminal 24 or 26. 



Electronic Mail 28 

35 " _ 

This application is similar to Telex 24 or 26 except that the DTMF telephone and TAN combination is 
substituted for a computer equipped with a modem. The caller can enter his message on a DTMF 
telephone without the need for using a computer. The TAN will transmit the message in the proper format 
to the recipient's E-Mail mailbox where it will appear to the receiving equipment as if it had been sent by 
40 communicating personal computer. 



Telegrams and Cables 24 or 26 

45 Telegrams and cables 24 or 26 are similar to Telex 24 or 25 messages except that the recipient does 

not have a Telex machine 24 or 26 to receive the message. Therefore, the message must be sent to a 

public telegraph office and then relayed to the recipient. 

The present method of sending a telegram is similar to the method of sending an alphanumeric page as 

described as above. The caller must dial an 800 number and dictate the message to an operator who types 
so it into a computer. The message is transmitted to the telegraph office nearest to the recipient. An operator 

in the destination office then calls the recipient and reads him the telegram. 

This technology eliminates the need for the caller to make an 800 call, eliminates the need for an 

operator to type the message, and eliminates the need for the operator at the destination office to call the 

recipient. Using the technology of this invention, the caller calls the TAN nearest him and enters the 
55 message which is validated by interaction with the TAN. The message is transmitted by the caller's TAN to 

the TAN nearest the recipient. This TAN dials out to the recipient and reads the message to him using the 

synthetic voice function of the TAN. 
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. Telephone Messages 

Most telephone calls are not completed on the first attempt because the recipient is not located next to 
his telephone when the call attempt is made. What typically happens is that the recipient's secretary or his 
5 answering machine 23 takes a message which is primarily the caller's name and telephone number. This is 
the first step in a phenomenon called "Telephone Tag". 

Rather than use expensive voice circuits just to leave a message with an answering machine or a 
secretary, the caller 10 can enter his message interacting with his local TAN. The local TAN sends the 
message to the recipient's TAN 20 which then dials out and delivers the message using the synthetic voice 
w function of the TAN for delivery by a voice mail box 25. In this way, the same result is achieved at a much 
lower cost because the message was transmitted using a packet transport network at costs of cents per 
thousand bytes instead of a voice network. Additionally, if the recipient 22 does not answer the TAN call 
and does not have an answering machine 23, the TAN will keep periodically attempting to deliver the 
message. Thus, the TAN substitutes for a recipient's answering machine 23. 

15 

Database Access 21 

Some examples of databases 21 which can be accessed using the present invention are telephone 

20 directory assistance, sports, weather, stock quotes, and entertainment, such as movie reviews, showtimes, 
and promotional messages. 

Databases 21 are typically accessed using a terminal or a PC or by a voice call to a human operator. 
Some databases 21 can be directly accessed by a telephone call to a computer but they require the use of 
a code book and an 800 telephone call. For example, if a caller wants a quote on IBM, he must make an 

25 800 call to the central database, he must look in his code book for the numeric code for IBM; e.g., 4112, 
and enter this on the DTMF keypad. The code book is required in this scenario because present systems 
only accept numeric (not alphabetic) input from a DTMF telephone. 

This invention allows operator unassisted access to a database 21 with a local telephone call to a 
nearby TAN. No code book is necessary as the caller merely enters, e.g., IBM, and interacts with the TAN 

30 to ensure that the message is received correctly. The database 21 is connected to the network 16 by a 
PTN access node 1 5. The TAN then queries the central database via a packet transport network data call 
instead of an 800 voice call. Thus, this invention eliminates the need for a PC, eliminates the need for a 
code book, and eliminates the need for an 800 telephone call. 

The United States Post Office nine digit zip code program is a specific example of a data base which 

35 could be made available for direct access using this invention. The database 21 comprises some 22 million 
records which are available to large volume mailers using computer matching software. The TAN can 
provide service to individuals desiring low volume retrieval of nine digit zips. The caller can access a TAN 
with a local telephone call and enter data such as state, city, street name and number, The local TAN will 
relay this information to the central database over a packet transport network. A central database computer 

40 will find the nine digit zip code and return it to the calling TAN which will then deliver it to the caller using 
the synthetic voice function. 



Mail Delivery 

45 

Messages entered via a TAN will be transmitted to the TAN nearest the recipient anywhere in the world. 
The TAN can be equipped to print on a printer 19 and mail the message to the recipient. This is especially 
useful for countries which have sparsely deployed telegraph offices. 

50 

FAX 

Messages entered via a TAN can be converted to facsimile format in the TAN or sent to a value-added 
network service provider which performs the conversion. In either case, the DTMF input message is 
55 delivered to the receiver's FAX machine. 

Figures 5A-5C illustrate a simplified, alternative embodiment of the invention similar to that described in 
my copending U.S. patent application serial no. 07/126,633 filed in the United States on November 30, 
1Q87. The alternative embodiment provides for guessing a word at a time through the use of a look-up 
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dictionary in which the words are stored in a data bank according to length, numeric equivalent and relevant 
frequency. The alternative embodiment lends itself best to unique words which do not have numerous 
numeric string equivalents. However the alternative embodiment is not as effective as the preferred 
embodiment of the invention in overall efficiency where the best features of full word guessing are 

5 combined with sophisticated work guessing in which words are guessed letter-by-letter. 

The method for building a string of tones for the simplified alternative embodiment is illustrated in 
Figure 5A. When the # button is depressed, the prompter 36 senses that the string is complete. Then it 
takes the complete string and looks it up in the word guessing dictionary 42. Using the look up function, to 
find words corresponding to the string of tones, the system guesses the words according to their probability 

w for that particular string of tones, and passes them to the speaker 48 for output to sender 10. The sender 10 
either denies the guess by pressing another # or accepts the guessed word by continuing with typing the 
next word, that is, by pressing any button but the # button. 

The correctly guessed words are appended to the current string of correctly guessed words and in this 
way, a complete message is constructed one word at a time. 

15 The word guessing dictionary 42 is illustrated schematically in detail in Figure 5B. Word guessing 
dictionary 42 is employed by the look up function diagrammed in Figures 5A to translate from DTMF strings 
to ASCII words. The input for the look up function is a string of DTMF digits. The output of the look up 
function is an ASCII word or words. The look up function is invoked by the prompter 36 of FIGURE 2B 
when the prompter 36 senses that the sender 10 has entered a # digit. This is the equivalent of a space bar 

20 on a typewriter. The acceptable digits used as input to the look up function are 2 through 9. These digits 
represent the twenty-six letters of the alphabet. The correspondence between the DTMF digits and the 26 
letters is exactly that inscribed on an ordinary DTMF keyboard button pad with the exception that the letter 
Q is assigned to the 7 digit and the letter Z is assigned to the 9 digit. 

Figure 5B is essentially a map of the layout of the word guessing dictionary 42 and a scheme for 

25 processing a string of DTMF tone digits to produce a guessed ASCII word. The guessed word is sent to the 
speaker 48 which produces an audio output to the sender 10. 

The map of Figure 5B proposes the ability of the look up function to find and guess words of size 2 to 
size 48 characters. In practice, most number combinations such as 22 have no words at all which 
correspond to them. 

30 The first task of the look up function is to remove trailing "one" digits which in this invention designate 
punctuation and the terminating # are counted. Then the look up function searches the base address table 
42a to find the base address of the table for the word of the string size just counted. In the example used 
herein with regard to the word THE, the string is taken to be 843 which in the map diagram of Figure 5B 
will be found in the size 03 table 42d being at address BA03. 

35 The conventions used in Figure 5B to describe the word guessing dictionary 42 organization are meant 
to be illustrative and not necessarily efficient. In practice, the implementation of the alternative embodiment 
of the invention would concentrate on the fact that most numeric strings have no such words at all which 
correspond to them. The colon (:) is used here to separate the arguments such as 843 from the first 
guessed word. The comma (,) is used here to separate the successive word entries for a given numeric 

40 string in the dictionary 42. The semicolon (;) is used here to terminate all the words for a given numeric 
string, e.g., 843. The right curly bracket (}) is used here to terminate all the entries for strings of a given 
size, e.g., 3 digits. There is no terminator for the whole dictionary as each table is independent of all the 
others. 

A feasible, if somewhat inefficient method of finding the THE in response to an 843 string is to (1 ) find 
45 the base address of the size 03 table in the base address table 42a (this address is shown as BA03); (2) 
search table BA03 for colons; and (3) inspect the 3 digit number preceding each colon until the string 843 is 
encountered. Then, the right side of the colon will have acceptable word guesses for the string 843 in the 
sequence of their probability. The actual probability of a word does not necessarily appear in the tables of 
the word guessing dictionary 42 illustrated in Figure 5B. It is present only by the implication of the 
50 sequence of the word within a given n digit numeric entry. The sequence of the words within a given n digit 
numeric entry implies the relative, though not absolute, frequency of the word. Each word in turn, will be 
retrieved and sent to the speaker 48 for audio output to the caller 10. The caller 10 will accept or reject the 
guessed words. If the caller rejects VIE, the last word in the table for 843, then the system will shift to 
single letter mode and will announce the shift to the sender 10. 
55 The single letter mode is equivalent of guessing one digit strings. The size 01 table 42b at address 
BA01 is shown in diagram 5B. Note that the 26 letters of the alphabet are contained in the size 01 table 42b 
in the sequence of their probability within a given numeric digit. 

Figure 5C discloses the mode arrangement choices for the DTMF multi-button pad. As previously 
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discussed, the DTMF keypad can be made to represent any number of different characters by a mode shift 
function analogous to the Figs/Lets key on a Telex keyboard. Figure 5C is an exemplary table of such an 
arrangement having four modes. Users of the system 100 would be supplied with overlays fitting over the 
DTMF keypad buttons. The overlay, or template, would display the mode arrangement choices. 

s There is an implied mode not shown in Figure 5C which is that used entering a single letter at a time. In 
this case, the entry of for example, the 2 button would cause a guess of A, then B, then C which are the 
relative frequency of these three letters in the English text. As another example, the depression of the 7 
button in a single letter mode would cause the prompter 36 to guess single letters in the sequence S, R, P 
and Q. If the sender 10 denied the Q, then the access node 12 assumes the number 7 and announces to 

10 the sender 10 that it is shifting to the numeric mode. Subsequently, all entries will be assumed to be 
numeric until the next mode shift. 

In mode 1, the Alpha mode, the 1 button represents the word terminators in addition to the # which 
means a space bar. After guessing the word, the prompter 36 will proceed to guess the terminator. The 
terminators are arranged in approximate order of their frequency. The terminators are, in order: period(.), 

)5 ccmma(,), question mark(?), colon(:), and semicolon(;). The zero (0) button is always the mode shift "UP" 1 
button. The mode shift is always UP - 1, 2, 3, 4 followed by 1. The shift is cyclic. The mode shift key is 
always available from any mode. The sender 10 or the prompter 36 may shift modes. Whenever either the 
sender 10 or the prompter 36 shifts modes, the prompter 36 will announce the new mode. The # is always 
the terminator button. 

20 There are many other clues but they do not appear to be necessary to implement in order to make the 
alternative system usable. For example, after a string of consonants when answering the letters one at a 
time, there will be a requirement to enter a vowel. This is a valuable tool that could be implemented for the 
prompter 36 but it is not necessary because of the fortunate arrangement of the letters on the buttons. Each 
vowel is on a different button and it is also the most likely guess for the letter on the button except for the 

25 vowel U which shares the 8 button with the T which appears more frequently in English text. The 
assumption is that the relative frequency of the appearance of letters in the English text is ETAONISHR- 
DLU... and so forth. 

Note in Mode 3, the miscellaneous mode, that the symbols for add( + ), subtract(-), multiply(x), divide(-), 
dollar sign(), and equals( = ) occupy individual buttons by themselves. Buttons 1 and 2 have a collection of 

30 symbols of low expected usage. Note that there is a # followed by the ampersand (&) symbol on the 2 
button in Mode 3 which is not to be confused with the # button meaning space. 

Special modes can be constructed for foreign languages. In addition, special tables can be constructed 
and saved on mass storage for closed user groups, i.e. a group of people or businesses with a particular 
set of communication requirements in common. For example, a national group of florists might have a 

35 special directory of numeric strings and their interpretation that might vary from that used by a casual user. 
For example, a special numeric string might identify a special arrangement of flowers to be delivered at a 
particular location. When the sign-on and password protocol is completed, alternative system will recognize 
the sender 10 password as belonging to a particular closed user group and call in the special Dictionary 40 
to be used during that session. 

40 It is anticipated that each sender 10 will have his or her own unique subscriber identification number 
and password in order to access the alternative system similar to the manner in which conventional data 
base systems are accessed. 

While the invention has been described with reference to the preferred embodiment thereof, it will be 
appreciated by those of ordinary skill in the art that modifications can be made to the parts and steps of the 

45 invention without departing from the spirit of the invention as a whole. 



Claims 

so Claim 1. A telecommunications system for sending messages from a sender having a telephone with 
tone pushbuttons connected to a conventional telephone network which in turn is connected to a receiver, 
said system comprising: 

a telephone line interface connecting said system to the telephone network of the sender; 
a word guessing means for identifying words as they are spelled out on the sender's telephone; 
55 a prompter means connected to said telephone line interface and to said word guessing means for 
accessing said word guessing means; and, 

speaker means connected to said prompter and to said telephone line interface for speaking the words 
guessed from said word guessing means to the sender. 
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Claim 2. The system of claim 1 wherein said word guessing means includes a word guessing dictionary 
means for storing words first according to their letter length, second according to their tone number 
equivalence and third according to their relative frequencies. 

Claim 3. The system of claim 2 further comprising: 
5 storing means connected to said prompter means for storing words and assembling said words into a 
message prior to transmission as non-analog data over said telephone network. 

Claim 4. The system of claim 3 further comprising: 
header means connected to said prompter means for generating a routing header for each message, 
wherein said message header directs said message to the receiver, 
ro Claim 5. The system of the claim 4 wherein said speaker means further includes: 

text-to-speech synthesizing means for converting the data transmitted over the telephone network into 
audible speech at the receiver's end of the telephone network. 

Claim 6. The system of claim 5 wherein said telephone network includes a packet transport network, 
said system further comprising: 
75 packet transport network access means for connecting said system to said packet transport network. 

Claim 7. The system of claim 6 further comprising: 
mode shift means for automatically shifting the mode of the system from a word guessing mode to a letter- 
by-letter guessing mode after the sender has exhausted all possible entries in the word guessing dictionary 
means for a word of a given predetermined length and for a given tone combination sequence. 
20 Claim 8. The system of claim 7 further comprising: 
tone storage connected to said prompter means. 

Claim 9. The system of claim 1 wherein said word guessing means further includes: 
a full word dictionary means for storing numeric strings corresponding to common words and for identifying 
words corresponding to a numeric string; 
25 a forward dictionary means for storing words and parts of words according to length and frequency; and, 
a reverse dictionary means for storing words and parts of words with their sequence reversed and 
according to length and frequency, 

wherein said numeric string is compared against words and parts .of words in said forward and reverse 
dictionary means if said word is not found in said full word dictionary means. 

so Claim 10. The system of claim 9 further comprising: 

letter subgroup examining means for examining said numeric string in groups of n, where n equals at least 
2 in order to identify individual letters according to their frequency within said numeric string, 
wherein said numeric string is examined by said letter subgroup examining means if said word is not found 
in said full word dictionary means, said forward dictionary means, and said reverse dictionary means. 

35 Claim 11. The system of claim 10 wherein letter subgroup examining means includes: 
an n-gram-initial file for identifying the first letter of said word; 
an n-gram-2 file for identifying the second letter of said word; 
an n-gram-3 file for identifying the third letter of said word; 

an n-gram-M file for identifying any letter that is not the first, second, third or last letter of said word; and, 
40 an n-gram-F file for identifying the final letter of said word. 
Claim 12. The system of claim 1 1 wherein n equals 3. 
Claim 13. The system of claim 12 further comprising: 
at least one digram file for storing letters according to their frequency in groups of 2 and for comparing said 
groups of 2 against subgroups of said numeric string. 
45 Claim 14. A method for determining the identity of a word for transmission over a telephone network, 
said method comprising the steps of: 

entering said word on a tone pushbutton pad where at least one pushbutton is identified by a number that 
represents more than one letter; 

forming a string of numbers from said letters entered on said tone pushbutton pad; and 
so comparing said string of numbers against a word guessing dictionary to determine the identify of said word, 

wherein words stored in said word guessing dictionary are stored first according to their letter length, 

second according to their tone number equivalence and third to the relative frequencies. 

Claim 15. The method of claim 14 wherein if said word s not found in said word guessing dictionary 

then said string of numbers is compared against a forward dictionary in which numeric strings correspond- 
55 ing to words are stored according to length and frequency and also comparing said string of numbers 

against a reverse dictionary in which the sequence of letters in a word is reversed and said reversed words 

are stored according to length and frequency. 
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Claim 1 6. The method of claim 1 5 further comprising the step of: 
if said word is not identified according to the steps above, then examining said string of numbers in groups 
of n where n equals at least 2 in order to identify individual letters according to their frequency within said 
groups. 

5 Claim 17. The method of claim 16 in which n equals 3 and in which said word is examined in 5 groups 
of 3 corresponding respectively to groups including the first letter of a word, groups corresponding to the 
seccnd letter of the word, groups corresponding to the third letter of the word, groups corresponding to 
middle letters of the word, and groups corresponding to the final letter of the word. 
Claim 18. The method of claim 17 further comprising the step of: 
10 examining said word in subgroups of 2 letters if the foregoing steps have not identified said word. 
Claim 19. The method of claim 18 further comprising the steps of: 
storing said words as they are identified; 
assembling said words into a message; 

transmitting said words in non-analog data form over said telephone network; and 
75 synthesizing said non-analog data into speech as it is received from said telephone network. 

Claim 20. An apparatus for determining the identity of a word entered on the tone pushbuttons of a 
telephone as a numeric string of numbers, said apparatus comprising: 

a full word dictionary means for storing numeric strings corresponding to common words and for identifying 
words corresponding to a numeric string; 
20 a forward dictionary means for storing words and parts of words according to length and frequency; and 
a reverse dictionary means for storing words and parts of words with their sequence reversed and 
according to length and frequency, 

wherein said numeric string is compared against words and parts of words in said forward and reverse 
dictionary means if said word is not found in said full word dictionary means. 

25 Claim 21 . The apparatus of claim 20 further comprising: 

letter subgroup examining means for examining said numeric string in groups of n, where n equals at least 
2 in order to identify individual letters according to their frequency within said numeric string, 
wherein said numeric string is examined by said letter subgroup examining means if said word is not found 
in said full word dictionary means, said forward word dictionary means and said reverse dictionary means. 

30 Claim 22. The apparatus of claim 21 wherein said letter subgroup examining means includes: 
an n-gram-initial file for identifying the first letter of said word; 
an n-gram-2 file for identifying the second letter of said word; 
an n-gram-3 file for identifying the third letter of said word; 

an n-gram-M file for identifying any letter that is not first, second, third or last letter of said word; and 
35 an n-gram-F file for identifying the final letter of said word. 

Claim 23. The apparatus of claim 22 wherein n equals 3. 
Claim 24. The apparatus of claim 23 further comprising: 

at least one digram file for storing letters according to their frequency in groups of 2 and for comparing said 

groups of 2 against subgroups of said numeric string. 
40 Claim 25. The apparatus of claim 24 further comprising: 

a prompter means connected to said apparatus for accessing said word guessing dictionary means; and 

speaker means connected to said prompter means for speaking the word guessed by said apparatus to the 

sender. 

Claim 26. The apparatus of claim 25 further comprising: 
45 message storing means connected to said prompter for storing words verified as correct by the sender and 
assembling words into a message prior to transmission of said words as non-analog data over a telephone 
system. 

Claim 27. The apparatus of claim 26 further comprising: 
Claim 33. The apparatus of claim 27 further comprising: 
so E-mail means for receiving messages identified by said apparatus and for transmitting same to remote 
recipients. 

Claim 34. The apparatus of claim 27 further comprising: 
telex means for receiving messages comprised of words identified by said apparatus and for transmitting 
said messages to remote recipients. 
55 Claim 35. The apparatus of claim 27 further comprising: 

cable means for receiving messages comprised of words identified by said apparatus and for transmitting 
said messages to remote recipients. 
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Claim 36. The apparatus of claim 27 further comprising: 
fax means for receiving messages comprised of words identified by said apparatus. 

Claim 37. The apparatus of claim 27 further comprising: 
message string means for storing messages comprised of words identified by said apparatus, 
5 wherein said apparatus compares number strings against said message storing means in order to identify 
subsequent occurrences of said number string. 
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FIG. 4C 
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D4Q-I67.7 

(D4Q) (NO ENTRIES FOR D4Q-G, D4Q-H, D4Q-I) 

D4R-I132.5 

D4R-H31.4 

D4R-G12.3 

D4S-I87.7 

D4S-H32.1 

D4S- (NO ENTRY FOR D4S-G) 



TRIGRAM 3 0I7-S416.5 USED TO GUESS THE 7 OF THE DI7- 
DI7-R401 4 THE THIRD DIGIT OF THE STRING. 

DI7-P306.5 
0I7-Q061.6 

TRIGRAM M SC6-0519.1 used to guess the 6 of the SC6- 

SC6-M401.1 A DIGIT SOMEWHERE IN THE MIDDLE 
SC6-N336.8 OF A STRING. 

TRIGRAM F ER9-Y303.6 USED TO GUESS THE 9 OF THE ER9- 
ER9-W209 7 THE FINAL DIGIT OF THE STRING. 
ER9-Z188.8 
ER9-X103.9 
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FIG. 5 A 
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WAIT 



LISTEN 
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SENDER 10 CONFIRMED 
THIS GUESS 



SAVE THIS GUESS 
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THIS GUESS 
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GUESS 



CONTINUE 



DELETE AND SAVE 
FINAL 1 DIGITS WHICH 
DEFINE PUNCTUATION 



LOOK UP STRING IN 
WORD GUESSING DICTIONARY 42 
SELECT WORD 

LOOK UP ACCORDING TO: 

1. STRING LENGTH (e.g. 3) 

2. STRING NUMERIC (e.g. 843) 
EXAMPLE: FOR LENGTH = 3. 
NUMERIC = 843, 

FIRST GUESS = THE 
SECOND GUESS = TIE 
THIRD GUESS = VIE 



SPEAKER 48 SPEAKS 

GUESSED WORD 
TO THE SENDER 10 
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FIG. 5B 



42A 



BASE AORESS TABLE 
SIZE BASE ADDRESS 



01 
02 
03 



BA01 
BA02 
BA03 

'BA48 



rABLE-BA02 SIZE 02 



42C 



22: 




23: 


BE; 


24: 


AH; 


25: 




26: 


AN, AH; 


27: 


AS; 



99lL 



£ 



42X 



TABLE-BA48 SIZE 48 


2222222222222222. 


...2 


2222222222222222. 


...3 


2222222222222222. 


...4 


9999999999999999. 


...9 



r 



42 



£ 



42B 



TABLE-BA01 


SIZE 01 


2 


A.B.C 




3 


E.D.F 




4 


I.H.G 




5 


UK. J 




6 


0, NLM 




7 


S.R.P 


Q; 


8 


T.ULV 




9 


Y. W, Z, X} 



TABLE-BA03 SIZE 03 



X 



42D 



222: 
223: 
224: 
225: 



CAB; 

BAD, CAD; 
BAG, BAH; 



843: THE, TIE, VIE; 
999: ) 
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PREFERRED MODE TABLE FOR DTMF KEYPAD 




ITTON 


MODE 1 

1 IUUL 1 


MODE 1 


MODE 3 


MODE 4 




ALPHA 


NUMERIC 


MISC 




1 


r 


l 


i o"" 


READ MSG 


2 


ARP 


p 




END MSG 


3 


DEF 


3 


$ 


CANCEL MSG 


4 


GHI 


4 


* 


READ PARA 


5 


JKL 


5 


/ 


SPARE 


6 


MNO 


6 


+ 


SPARE 


7 


PQRS 


7 




SPARE 


8 


TUV 


8 


% 


SPARE 


9 


WXYZ 


9 




SPARE 


0 


BACKSPACE 


0 


BACKSPACE 


BACKSPACE 


X 


MODE SHIFT UP i 








# 


SPACE 









